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I . INTRODUCTION 

The linear statistical model is a mathematical formula- 
tion which IS useful in the interpretation of data and ob- 
servations from experiments. While the assumptions necessary 
for the use of a linear statistical model restrict its appli- 
cation, it is frequently convenient and appropriate to use 
such a model because of its simplicity and its suitability 
to many processes and experiments under investigation. 

This paper will be concerned with one of the most fre- 
quently utilized linear statistical models. It is assumed 
that n observations, Y^, are made of a process or experimen- 
tal quantity. The process or experiment has p elements, , 
each of which has a fixed value for each of n replications. 
Associated with the vector of observations is a vector of 
errors, denoted by e. The errors are assumed to be uncor- 
related between replications and have a multivariate distri- 

2 

bution with mean vector 0 and variance-covariance matrix a I. 
In matrix notation, the model is expressed as 

y = X3 + e , 

where Y is an n x 1 vector of observations with mean vector 

2 

X3 and variance-covariance matrix a I. X is an n x p matrix 

of known constants, and 3 is a p x 1 vector of unknown param- 
2 

eters. a is the unknown variance of the individual observa- 
tions . 

By the use of the general linear statistical model, one 

can estimate the assumed functional relationship between Y 

and X. If the matrix X is of full rank, we accomplish this 

2 

by estimation of a and 3. In the less-than-full rank case. 
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2 

we concern ourselves with estimation of o and F'g, sets of 
linearly independent estimable functions. Several tests of 
hypotheses are frequently useful in the evaluation 
of the functional relationships in the model. Such tests pro- 
vide conclusions concerning the values of the unknown param- 
eters which would be useful in predicting values of Y or in 
explaining the variability of Y. 

This paper discusses the mathematical manipulation of 
a linear model by which conclusions regarding the process 
under investigation might be reached. The general linear 
statistical model has traditionally been treated as two sep- 
arate cases, with two different methodologies. These sep- 
arate methodologies are necessitated by two general forms 
which the matrix X may assxime. These forms are the full rank 
form, where the n x p matrix X is of rank p, and the less- 
than-full rank form where X is of rank r < p. The applica- 
tion of the general linear model to experimental design fre- 
quently gives us the less-than-full rank case, whereas in the 
regression analysis application, the full rank case is most 
often encountered. 

As an example of the apparent necessity for different 
methodologies, consider estimation of the vector of unknown 
parameters. The method of least squares yields the set of 
equations , 

X'X6 = X'Y , 

where 6 is the least squares estimator of the vector 6. If 
X is of full rank, a unique solution for 3 exists, because 
X'X has an inverse. If, however, the rank of X is r < p, a 
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unique solution does not exist, and reparametrization is com- 
monly used to estimate the invariant features of the solution 
vectors, 6. Reparametrization involves the linear transforma- 
tion of the vector S into the vector a and the consequent 
change of the model from 

Y = XB + e 
to 

Y = Za + e , 

where Z is a matrix of full rank. The mathematical manipula- 
tion of this model can then be carried out in much the same 
fashion as the manipulation of the full rank case. The lack 
of a unified methodology led the author to consider an ap- 
proach by which both the full rank and the less-than-full rank 
cases could be manipulated by the same mathematical technique. 

In the chapters to follow, manipulations of the general 
linear statistical model will be formulated in terms of the 
Moore-Penrose generalized inverse. The definition of the 
generalized inverse will be taken as the point of departure 
for Chapter II. A survey of the properties of the generalized 
inverse which are preliminary to the statistical formulation 
will be presented. 

In Chapter III the estimation of parameters and functions 
of parameters, and the distribution of the estimates will be 
discussed. The formulation and testing of hypotheses will 
be examined in Chapter IV. For a part of tne third chapter 
and the entire fourth chapter it will be necessary to specify 
a multivariate distribution for the vector of errors. We 
shall assume that e has the multivariate normal distribution; 
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2 

that is, e'^MVN(0,a I). This is a commonly assumed distribu- 
tion, as it is justifiable for observational errors in a 
wide range of processes and experiments. 

Appendix A is composed of a simple numerical example of 
the unified methodology derived in the preceding chapters. 

An experimental design application of the general linear sta- 
tistical model has been chosen to illustrate the use of the 
method in the less-than-full rank case. 

Throughout the development, it will be noted that strong 
reliance has been placed upon the notation, terminology, and 
methods of proof of Graybill (1). This is due, in part, to 
Graybill's consistency with the generalized inverse formula- 
tion. Equally important, however, is the simplicity with 
which Graybill has presented the basic statistical theory. 
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II . 



THE GENERALIZED INVERSE 



The mathematical concept known as the generalized in- 
verse of a matrix was first introduced by E. H. Moore in 1920. 
Moore (2) presented the first published systematic investiga- 
tion of the properties of the generalized inverse in 1935. 

The concept was not widely recognized until 1955 when R. 
Penrose (3) independently rediscovered it. 

The Moore-Penrose generalized inverse of an arbitrary 
matrix A has been defined by Penrose (3) as the solution, 

G = a"*" of the following four equations. 



AGA = A (2.1) 

GAG = G (2.2) 

(AG) ' = AG (2.3) 

(GA) ' = GA (2.4) 



It is convenient to describe the generalized inverse of 
an arbitrary matrix by two equations which are equivalent to 
the defining equations. These equations can be formed by 
substituting eqn. (2,3) into eqn. (2.2), and eqn . (2.4) into 

eqn, (2.1), Respectively, these relationships are 
GG'A' = G (2.5) 

and 

AA'G' = A (2.6) 

There are several properties of the generalized inverse 
which will be useful in developing the statistical aspects 
of this paper. These properties are stated below with source 
references, when appropriate, where a detailed development 
of the property may be found. 
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Property (1) ♦ The generalized inverse specified by 



the four defining equations is unique. [Penrose (3)] 



TV t+ , 

. A = A. 


[Penrose (3) ] 


. a'*’’ = a''*’. 


[Penrose (3)] 


. a'*’ = A'A'*’’ 


A^. [Penrose (3)] 


. A^A, AA"^, 


I-a"*'a, and I-AA"*" are each 


[Penrose (3)] 




. Rank (A"*”) 


= rank (A) = ranJc (a'^a) = 


: trace (a"'’a) . 


[Penrose (3)] 



Property (7) ♦ (AB)'*’ = if, and only if, a"*’a 

4. 

and BB' commute, and A' A and BB commute. [Greville (4) ] 

Property (8) . A necessary and sufficient condition 

that AXB = C has a solution for X is AA'^CB"*’b = C. 

+ + + + 

The general solution isX=ACB +Z-A AZBB , where 
Z is arbitrary. [Penrose (3) ] 

Property (9) . If A is an n x p matrix of rank p. 

(A'A)'*’ = (A'A)“^. 

Proof: (A ’A) has an inverse, for it is a p x p matrix 
of rank p. It may easily be verified that (A'A) ^ satisfies 
the defining equations for the generalized inverse of (A'A) . 
As the generalized inverse is unique, (A'A)"^ = (A'A) 

Property (10) . If A is an n x p matrix of rank p, 
a'*'a = I. 



Proof: We know from matrix theory that if A is an 

n X p matrix of rank p, then A has a left inverse. It can 
be shown by substitution into the defining equations that 
the left inverse if a"^. The right inverse result for a 

-f- 

p X n matrix of rank p is, similarly, AA = I. 
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Property ( 11 ) . If a matrix A is of full rank and 
is partitioned such that A = (A^ • A 2 ) then 

^1 ■ t (I-A^ApA2l'^ 



A'^ = 



[ (I-A^A^)A2]'^ 



[Cline (5)] 



In particular, it is noted that when A^^'A 2 = 0, then 

+ ' 



A+ = 



since A^A 2 = (A^A^ ' A^^ ' ) A 2 = 0. 
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III. ESTIMATION 



Making the assiamptions for the general linear statis- 
tical model , 

Y = X3 + e, 

2 

where E(e) = 0 and E(ee') = a I, provides us with a basis for 
estimation of the vector of unknown parameters and functions 
of the parameters. Estimation of these quantities is equiv- 
alent to estimation of the relationships which exist between 
the observations, Y, and the matrix of known constants, X. 

As previously indicated, using the method of least 
squares to minimize (Y-XB) ' (Y-X3) yields the normal equations, 

X'X6 = X'Y. 

Regardless of the rank of X, properties (4) and (8) give us 
a general solution for the least squares estimator of B, 

B = (x'x)’^x'Y + [i-(x'x)’^(x'x) ]z = x''‘y + (i-x‘''x)z, 
where Z is an arbitrary p x 1 vector. It is apparent that 
if X is of full rank, then by properties (9) and (10), 

3 = x‘'‘y = (X'X) ”^X' Y, 

which confirms the result derived in Chapter I. However, we 
will not set the full rank case apart, for the general solu- 
tion holds for rank (X) = r ^ p. 

Least squares estimation does not directly provide an 

estimate of a . However, by basing our estimate of a on B , 

2 

an unbiased estimate of a is 

-2 ^ (Y-XB) ' (Y-XB) ^ Y' d-XX'*') Y 
n - r n - r 

Of immediate interest is the mean vector and variance- 

/N 

covariance matrix of the estimator B. Recalling that Y has 
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2 

mean vector XB and variance-covariance matrix a I, 

E(3) = E [x‘‘‘y+ (I-X‘''X) Z] = X'*'XB + [i-x‘‘‘x]z 

and 

Var(B) = E [ (X‘^Y-X'*’XB) (x‘'’y-X'‘’xB) ' ] = E [x'^ee ' X'*' ' ] = a^(X'X)‘‘‘. 

/N 

It is noted that 6 is not, in general, an unbiased estimator. 

However, if the vector 3 happens to equal our choice of the 

arbitrary vector Z, the estimator 6 is unbiased. 

If we now specify the distribution of the vector of 

2 

errors such that evMVN(0,a I) , it can be shown that the 

method of maximum likelihood estimation leads to precisely 

the same results as the least squares estimation of B. 

2 

By assuming that E(e) = 0 and E(ee') = a V, where V is 
an arbitrary positive definite matrix, the estimation can be 
somewhat generalized. The normal equations stemming from 
least squares estimation in this more general case are, 
(X'V"^X) S = X'V"^Y, 
with solutions , 

B= (X'V“^X)'''X'V"^Y + [I-(X'V"^X)‘‘‘(X'V"^X) ]Z. 
Furthermore , 

E(B) = (X’V"^X)'^(X'V"^X)3 + [I-(X’V"^X)‘''(X'V"^X) ]Z 
Var(B) = E[(B-E(B)) (B-E(B)) '] 

= E[ (X'v“^X)'^X'v"^ee'v"^X(X’v“^X)'^] 

= (x'v"^x)‘''(x'v“^x) (x'v"^x)'*'a^ = a^(x'v"^x)"''. 

In each of the distributional cases discussed above, the 
estimator of B is not unique if the rank (X) = r < p. In the 
less-than-full rank case no direct inference can be made con- 
cerning all the values of the elements of B. However, there 
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are invariant quantities in any solution which are called 
estimable functions. 

Consider a linear combination of the elements of 6, de- 

/V. 

noted by f'3. f'3 is invariant with respect to the arbitrary 

Z if, and only if, 

f [I-X‘‘’X] = 0. 

If X is of full rank, then this relationship holds for all 
f ' . In the less-than-full rank case, all linear combina- 
tions of the form where f = b'X have this property since 

f [I-X'^X] = b' [X-XX'^X] = 0. 

Furthermore, since the nullity of [I-x"^X] is r, any basis for 
the null space of [I-x'*'x] is a set of r linearly independent 
solutions to f [I-x'*'x] = 0. Since the rank of X is r and 
b'X[I-x"*'x] = 0, all such vectors f can be written f = b'X. 
We may define a linearly estimable function in terms of this 
invariance property; that is, a linear function of the param- 
eters, f'3, is estimable if, and only if, f (I-X"^X) = 0. 
Graybill (1) defines a linearly estimable function as a 
function of the unknown parameters for which there exists a 
vector b' such that 

E(b'Y) = b'X3 = f'3. 

Clearly, the two definitions are equivalent. Graybill fur- 
thermore proves that it is equivalent to say that a linearly 
estimable function is a linear combination of the parameters 
for which a solution for r exists to the equation 

f = r'X'X. 

It is this form of the estimable function which will be most 
useful to us. 
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Among other results which Graybill has developed with 
applicability in the generalized inverse formulation are 
the three listed below: 

(1) . The best linear unbiased estimator of an 

estimable function, f'3, is 

/\ 

f'3 = f'6 

(2) . The functions, f|B, ..w are called 

linearly independent estimable functions if each 
of the functions is linearly estimable, and f|, 
f^r ..., fj^ are linearly independent vectors. 

(3) . A set of linearly independent estimable 
functions contains, at most, r functions, where 
r = rank (X) . 

A linearly estimable matrix function, F'B, is a set of 
m linearly independent estimable functions, 1 _< m _< r. A 
matrix function may be represented as 






^2 



F' B = 



(B) = (R'X'X)B 



where the f|B are linearly independent estimable functions. 

A result which will be frequently utilized is that the 
estimator of an estimable matrix function is invariant with 
respect to the arbitrary matrix Z, which follows directly from 



the fact that each component of F'B has this property. 
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2 

Considering the distributional case where e'^^MVN(0,a I), 
the estimator of an estimable matrix function is always 
unique and, therefore, of use in making inferences regarding 
the values of the estimable functions. The mean vector and 
variance-covariance matrix of our estimator are, respectively, 
E(F'B) = E(F'X'^Y) = R'X'(XX‘^X)6 = F'3 

and 

Var(F'B) = E [ (F' X'^Y-F' 3) (F'X‘'’y-F' 6) '] = E (F ' X'^'ee ' X"^ ' F) = O^F'F. 
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IV. TESTS OF HYPOTHESES 



We shall now discuss a test of the hypothesis F'6 = C, 
where F'B is an m x 1 vector of linearly independent estim- 
able functions. F'B = C is called a linearly estimable 
hypothesis if F'B is a set of linearly independent estimable 
functions and C is a vector of known constants. To illustrate 
this concept, suppose we desire to test the equality of the 
first three components of B; that is. 



Hq:Bi ~ ^2 ~ ^3* 



If the functions. 







f|B =(1-10 


0* • *0) B 




and 




f^B =(11-2 


0- • -0) B 




are 


linearly 


estimable, then 










F'B = (fi,f^) 'B = 


■^1 -^2 


= 0 








Bi +B2 -2B3 




if. 


and only 


if, Bj_ = B2 = B3. 


We require that the 



estimable functions of the hypothesis be independent. To 
have any of these not independent would be redundant. For 
example, in the illustration above. 



f^B = (0 1 -1 0 - • - 0 ) B = 0 

would test the equality of ^3/ which is already being 

tested by f|B and f^B. 

In the full rank case, a consequence of the form of the 
linearly estimable function and the estimable hypothesis is 
that the elements of B, individually, and as a complete set, 
are estimable. The hypothesis that B = C is then estimable. 
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Furthermore, any subset of the elements of 6 equal to a 
specified set of constants may be tested by choosing the ap- 
propriate rows of the p x p identity matrix as the matrix F'. 
In developing the likelihood ratio test criterion, we 

will be concerned only with the normal theory case where 
2 

e'vMVN( 0 ,a I). The appropriate likelihood function is then 



f(e;3,a^) = ^ 



T 2.n/2 
2ira ) ' 



exp 



- (Y-X6) ' (Y-X 3 )' 



2a' 



To develop a test of the hypothesis Hq; F '3 = C versus the 
alternative, H^:F '3 7^ C, we will utilize the likelihood 
ratio , 

A _ L (to) 

y - — ^ • 

L 



L(fJ) is the maximum value of the likelihood function with 

2 

the parameters contained in the (3/cr ) p + 1 dimensional 

/N 

space, which is unrestricted. L(to) is similarly the maxi- 
mum value of the likelihood function with the parameter 
space restricted by Hq. The to space is p - m + 1 dimen- 
sional because values for m independent relationships among 
the elements of 3 are specified by the hypothesis. The maxi- 
mizing values of the parameters in the to space shall be de- 
~ ~ 2 

noted by 3 and o , while the corresponding values xn the 

/s 2 

space shall be denoted by 3 and a . 

In the restricted parameter space we desire to maximize 
2 2 

f(e;3,a ) with respect to a and 3 subject to the restraint. 



F' 3 = C. 



Let 

(j)(e;3,a^) = log^ [f (e ; 3 ,o^) ] . 
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It follows that 



d<i> _ (Y-XS) ' (Y-Xb) n _ ^ 

2 '•4 ~2 ^ ' 

do 2a 2a 

2 

The resulting maximizing value of a for any given 6 is 

~2 ^ (Y-XB) ' (Y-X3) 
n 

Examining the likelihood function it is noted that the 
maximum value of the likelihood function with respect to B 
with a^ = o^ = (Y-XB) ' (Y-XB)/n occurs when (Y-XB) ' (Y-XB) is 
a minimum. However, we must constrain B by the relationshio , 
F'B = C. The problem of finding the value of B which maxi- 
mizes the likelihood function within the hypothesis constraint 
can then be formulated as the quadratic program, 

min (Y-XB) ' (Y-XB) , 
subject to: F'B = C. 

The derivatives of the Lagrangian of the objective function 
are : 

= -2Y'X + 2B'X'X - 2X'F' = 0 (4.1) 

~ = F'B - C =0, (4.2) 

which are the necessary and sufficient conditions for a mini- 
mization where 2X' is the appropriate vector of Lagrange 
multipliers. We first premultiply eqn. (4.1) by AX'*"', 
yielding 

-AY + F'B- AA' X = 0. 

Substituting C, from eqn. (4.2), for F'B^ the solution for X 
is 

X = (AA' )‘‘‘C - a‘‘‘'Y. 
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Returning to eqn. (4.1) with this value of X, and solving 
for 3, yields 

3 = (X'X)‘^X'A' (A'‘’’a‘''c-A''''y) + X‘*‘X''^X'Y + (I-X‘‘‘x)0 
= x'^’a'^C-x'^a'^AY + x’''y + (I-X‘*'X)Q, (4.3) 

where Q is an arbitrary p x 1 vector. 

Since 

-2X' Y+2X’X[x'^a‘^C-X'^a'^AY+x‘'‘y+(I-X'^X)Q]-2X'A' [ (AA* )'*’C-a‘'’' Y] = 0, 
-x'y+x'a‘*'c-x'a‘'’ay+x'y-x'a'^c+x'a'a‘^'y = 0, 

clearly, eqn. (4.1) is satisfied by 3 and X. Also 
F’3 = r'x'x[x‘^a‘^c-x‘*‘a'^ay+x'''y+(i-x'^x)q] 

= aa‘^c-aa‘'‘ay+ay. 



since A is an m x n matrix of rank m, 

AA"*" = I, 



and eqn. (4.2) is also satisfied. 

Returning to the likelihood function and substituting 



a , while retaining the general term, 3, we find that 

L(w) = max 
2 

a eio 
F' 3 = C 



((e,p,o2) = n"/^exp[-n/21 



(2tt)^/^ [ (Y-X3) ' (Y-X3) 



2 

Finding the maximizing values of a and 3 in the unre- 
stricted parameter space yields the results, 

3 = x'^Y + (I-x‘''x) Z (4.4) 

i2_ (Y-X3) ' (Y-XS) 

a , 

^2 . 

where Z is an arbitrary p x 1 vector, and a is a biased 
2 

estimator of a . Combining these results in the likelihood 
function yields the maximum value. 
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L ( n ) = max f ( e ; B / 0 ^ ) 

2 o 

0 efi 

Ben 



n^^^exp [ -n/2] 

[ (Y-XB) ' (Y-XB) 



The likelihood ratio is then 



L(w) _ 


"(Y-XB) ' 


' (Y-XB)‘ 


L («) 


.(Y-XB) ' 


' (Y-XB) . 



1 n/2 

9 = 

where 6 and B are as presented in equations (4.3) and (4.4) 
respectively. 

To develop our test criterion we will first manipulate 
the quadratic forms of the likelihood ratio to form chi- 
square random variables. The independence of the quadratic 
forms will then be demonstrated and the test criterion estab- 
lished as a non-central F statistic. 

For any B» 

(Y-XB) ' (Y-XB) = (Y-XB) ' (Y-XB) + (B-6) ' (X'X) (B-e) . 

As a consequence, the likelihood ratio can be written as 

n/2 



0 = 



' 


1 


1+(B-B) ' 


x'x(g-B) 


(Y-XB) 


' (Y-XB) 



(4.5) 



Let us now examine the ratio in the denominator. Using 

/s ~ 

the solutions for B and B, it follows that 
(g-B)X'X(B-B) 

= (xx"''y-xx‘^a‘''c+xx'''a‘''ay-xx‘*'y) ' (xx'^y-xx'^a‘^c+xx‘*'a'’'ay-xx'^y) 
= (xx'''a'^ay-xx'^a'‘'C) ' (xxVay-xx'’'a'^c) 

= (AY-C) ’ (A'^’XX'^a'^) (AY-C) . 

By the definition of a non-central chi-square (x^"") 

2 

random variable, if Z^^MVN (y , V) , then Z ' (q,l)/ where 

q is the rank of BV and X = y' By/2, if, and only if, BV is 
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idempotent. Then if [ (a'*' ' XX''’a'^) / a^ ] [ Var (AY-C) ] is idempotent, 

/S 'N. ^ 2 

(6-S) *X *X (6-3) /a has a non-central chi-square distribution. 
Since Y'VMVN (X6 ,a^I) , (AY-C) '^/MVN (AXg-C ,a^AA' ) and 

BV = 

We must determine whether, or not, a"*” ' XX"''a''’aa ' is idempotent. 
It is clear that 

a'^'XX'^ (A'^AA* ) = a'^'XX‘'’(A' ) = A^'XX'^(XR), 
where R'X'X = AX. Therefore, 

a"*" ' xx‘'’a‘''aa ' = a‘‘''XR = a'^'A'. 

Since A' is a p x m matrix of rank m, it has a left inverse, 
a"^ ' , and 

BV = a'*' ' XX'^A'''AA ' = I. 



Hence, BV is idempotent and of rank m. We can therefore 
conclude that (AY-C) ' (a"^ ' XX"^a"*") (AY-C) /a^ has the non-central 
chi-square distribution with m degrees of freedom and non- 
centrality parameter, X = (F' 3-C) ' (a"*"' XX'*’a''’) (F' 6-C) /2a^. 

The quadratic form, (Y-X§) ' (Y-X§) is equal to Y'(I-XX'^)Y. 
Since (I-XX^) is idempotent of rank n-r, (Y-XB) ' (Y-X§) has 
the central chi-square distribution with n-r degrees of free- 
dom. Consequently, our likelihood ratio is 

n/2 



6 = 



1+1 



(4.6) 



where (F* B-C) ' (a'^’XX'^a'^) (F' 3-C) /2a^) and the variable 

5'VX^ (n-r) . 

In general, two quadratic forms T'DT and T'ET are inde- 
pendent if, and only if, DVE = 0, where TvMVN(ia,V). 
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The identities, 

Y = (3-6) 'X'x(3-3)/a^ = (Y-X3 ) XX'^ (Y-X3) /a^ , 

and 

? = (Y-X3) ' (Y-X3)/a^ = (Y-X3) ' (I-XX’*') (Y-X3)/a^ 
are convenient forms for demonstrating the independence of 
T and ^ . Since 

(Y-X3) = (i-xx'*'+xx'*'a'^a) y-xx'^a'''c , 

2 

and the variance of Y is a I , then 

Var(Y-X3) = ( I-XX'^+XX'^a'^'a) ( I-XX'^+XX‘‘'a‘^A) ' = ( I-XX'''+A'‘'a) . 

The quantity corresponding to DVE is then 



^X'X^ 


(1-XX‘^+A'^A) 


'i-xx'^^ 


2 


2 


J 




0 

1 J 



as can be easily verified. 

The ratio of the non-central chi-square variable to 
the central chi-square variable in the denominator of our 
likelihood ratio with each variable divided by its degrees 
of freedom has the non-central F distribution with m and n-r 
degrees of freedom and non-centrality parameter, 

A= (F’ 3-C) ' (a‘'’'XX'^a‘'^) (F* 3-C) . 



A necessary and sufficient condition that A = 0 is that the 
hypothesis is true; that is, F'3-C =0. In this event, the 
ratio of the quadratic forms divided by their degrees of 
freedom has a central F distribution. 



Y (n-r) _ (AY-C) ' (A ' ~^XX~*'a'^) (AY-C) 



C (in) 



vp 



(Y-X3) ' (Y-X3) 



m,n-r 



Since rejection of the hypothesis is consistent with small 
values of the likelihood ratio and the likelihood ratio is 
monotonic decreasing in [y (n-r ) ] / [ 5 (m) ) ] , the critical region 
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for this test is 



Y (n-r) 

C (m) 



F 

m,n-r 



(a) 



/ 



where (a) is the upper significance point correspond- 

ing to significance level a. 

As substantiation of our likelihood ratio criterion, we 
shall compare it with commonly used criteria for two types 
of tests in the full rank case. The common form of the full 
hypothesis test criterion for 3=6*/ Graybill (1) for exam- 
ple, is 

n-p (Y-X3*) (X(X'X)~^X') (Y-X3*) . „ , . 

P (Y-XX Y) ' (Y-XX Y) P 



It is apparent that the denominators are equal. Furthermore, 
(Y-X3*) ' (X(X'X) ”^X' ) (Y-X3*) = (Y-X3*)'(XX'^) ' (XX"*") (Y-X3*) 

= (x‘*'y-3*) ' (X'X) (X'^Y-B*) . 



Therefore the test criteria are equivalent in the full 
hypothesis test. 

To test that s of the elements of 3 are equal to known 

constants, while the other p - s elements are unspecified, 

Graybill (1) uses the subhypothesis criterion, 

^ ^ (Y-X^Yi*) ' (X(X'X)"^X'-X„(X,'X,)"^X ') (Y-X^Yi*) 

^ ^-4—^ > F _ (a) . 

® (Y-X^Y^*) ' (I-X(X'X) -^X' )(Y-X^Yi*) s,n p 



For the purpose of this full rank test, the elements of the 
general linear model have been partitioned such that 

X = (Xj_,X2) , 3' = (Yi,Y2) 

and the hypothesis is Yj_ = Yj_*/ where Yq^* is an s x 1 vector 
of known constants. 
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In the generalized inverse formulation this hypothesis 
may be tested by setting C = and choosing F' of the 

form F' = [I ; 0] , where the identity matrix is of dimen- 
sion s and the null matrix has dimensions s x(p-s) . As a 
simplification we shall consider only the full rank case 
where y^ is orthogonal to y 2 ; that is, ^^^2 “ Since 

F' = AX = [I ; 0] , we must have 



AX^ = I 
AX 2 = 0. 



By property (11) of Chapter II, it is then clear that 






A = Xi . Our hypothesis is then F'B = the test 



criterion is 






n-p 



(XiY-yi*) ' (X^'XX^X^) (X^Y-yj^*) 
(Y-X6) ' (Y-X6) 



> F^ ^ (a) 

s ,n-p 



Again it is apparent that the denominators are equal. 
The numerators are also equal since 

(Y-Xiyi*) ' (X(X'X)"^X'-X2(X2'X2)"^X2') (Y-X^y^*) 

= (Y-Xj^y^*) ' (XX'"-X2X2'^) (Y-Xj^y^*) 

= (Y-X^y^*) ' (X^X^"^) (Y-X^y^*) 

= (Y-X^y^*) •Xj_‘^(Xj_XX'^Xj_)X^'^ (Y-Xj_y^*) 

= (X^Y-y*) ' (Xj^XX'^X^) (X^Y-y^*) . 

The equivalence of our test criterion to the commonly 
used criterion in this special case of the subhypothesis 
test is then clear. 
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APPENDIX A 



Estimation and Hypothesis Testing - An Illustration 

Consider a completely randomized experimental design 
model, where 

Y.,=y+i,+e.. i=l,2,3 

j = 1,2 

We shall suppose that the required normality assumptions 
are justifiable for our statistical model, 

Y = X3 + e 



where 

e MVN (0 ,a^I) . 

The necessary elements of our statistical model are: 
Observations Design Structure 



1 




'l 1 0 o' 


3 




y 




110 0 


2 








10 10 




3 = 


X 


X = 




2 




^2 




10 10 


8 




”^3 




10 0 1 


10 




10 0 1 



The rank of X is r = 3, By solution of the equations de- 
fining the generalized inverse. 



111111 



X"^ = 1/8 



3 3-1-1 -1 -1 

-1-1 3 3 -1 -1 

-1 -1 -1-1 3 3 
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and (X'X) = 1/64 



6 2 2 2 
2 22 -10 -10 
2 -10 22 -10 
2 -10 -10 22_ 

As this is an example of the less-than-full rank case, 
the estimate is not unique. The estimator has the general 
solution , 

e = X'^Y + [I-X'^X]Z 
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2 


-2 


-2 


-2' 




-10 




-2 


2 


2 


2 


1/8 


-10 


+ 1/8 


-2 


2 


2 


2 




46_ 




-2 


2 


2 


2^ 



The mean vector of 3 is 
E(3) = X‘‘'X3 = [I-X‘‘‘X]Z 



= 1/8 





6vi+2t^+2t2+2t3 




• 2Z^-2Z2-2Z3-2z; 






2y+6T j^-2t2~2t3 




— 2Zj^+2Z2“^"2Z24'2Z^ 






2ij-2t^+6t2-2t3 


— 2Z^4-2Z2“^"2Z2"^'2Z^ 




L 


2p-2t^-2T2+6t3 




— 2 Z j^+2 Z 2“^^ ^ ^ 4 





and the variance-covariance matrix of 3 is 

3 111' 



Var (3) = a^(X'X)'^ = a^/32 



1 11 -5 -5 
1 -5 11 -5 
1 -5 -5 11 



Furthermore, the unbiased estimate of a is 



-2 ^ (Y-X3) ' (Y-X3) ^ 4/3 
n-r ' 
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The estimator of 6 in the less-than-full rank case by 
itself does not tell us anything about the individual com- 
ponents of 6 . However, consider the estimate of a function 
of parameters , where 



and 



R' = fO 1/2 -1/2 O] , 

^0 1/2 1/2 -ij 

A = R'X' = '1/2 1/2 -1/2 -1/2 

1/2 1/2 1/2 1/2 



0 

-1 



N 

0 

-1 



/ 



F' = AX = 



'O 1-1 0\ 

^0 11 -2 



The estimator of the estimable matrix function, F'S, is 



F' § = FX‘*'y 




with mean and variance-covariance matrix 



E (F* 3) = F' 3 = 



^1 ^2 



Var (F' 3) = a^F'P = 



2 0 
0 6 



It is commonly desired to test the equality of all the t^, 
the treatment effects. In this event, the corresponding 
estimable hypothesis is 



F'3 = 


• — 1 
O 


-1 


r 

o 


3 = 


Tj 


II 


0 

V 




[0 1 


1 — 1 


- 2 ^ 




1 T 1 +T 2 - 2 T 3 




loj 
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For this estimable hypothesis, the generalized inverse of 
A' is 



' - 



1/2 1/2 - 1/2 - 1/2 0 0 
1 ^ 1/6 1/6 1/6 1/6 - 1/3 - 1/3 

The test statistic, 

(AY-C) ' (A'*''XX'*'a'^) (AY-C) ^ 
(Y-X§) ' (Y-X6) 

The critical region for this test is 

r > 

n-r 



fn-rl 


Y _ 


c ^ 

n-r 


1 m J 


5 


m J 



m J 



1 > F 



2,3 



(. 05 ) = 9 . 55 . 



We must therefore reject the hypothesis, • 



24 . 5 . 



II 

1 " "^2 ^ 3 ’ 
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